A methodology for interschema relationship identification in heterogeneous databases

نویسندگان

  • Venkataraman Ramesh
  • Sudha Ram
چکیده

information shoring among databases, scientific or business, requires the development of techniques for accessing data porn multiple heterogeneous databases. Orie approach to providing interoperability among these databases, is to de$ne one or more federated schemas which represent a coherent view of the underlying databases. A review of existing research on schema integration, the process of generating integrated schemas, points to the need for development of techniques for identifying objects in multiple databases that may be related This process of interschema relationship identification is the focus of this paper. In this paper, we present a methodology for utilizing schematic as well as integrity constraint knowledge for interschsma relationship identification. We present the details of a comprehensive set of heuristics for schematic interschema relationship identification. We also introduce the concept of constraint-based relationships and describe how such relationships can be generated. Finally, we describe heuristics for generating “real world” interschema relationships based on the schematic and constraint-based relationships generated.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Clustering Similar Schema Elements Across Heterogeneous Databases: A First Step in Database Integration

Interschema relationship identification (IRI), that is, determining the relationships among schema elements in heterogeneous data sources, is an important first step in integrating the data sources. This chapter proposes a cluster analysis-based approach to semi-automating the IRI process, which is typically very time-consuming and requires extensive human interaction. We apply multiple cluster...

متن کامل

Clustering Schema Elements for Semantic Integration of Heterogeneous Data Sources

Interschema relationship identification (IRI), that is, determining the relationships among schema elements in heterogeneous data sources, is an important step in integrating the data sources. This article proposes a cluster analysis based approach to semi-automating the IRI process, which is typically very time-consuming and requires extensive human interaction. The authors apply multiple clus...

متن کامل

5 Related Work

12 integration, such as semantic heterogeneity, interschema dependencies, local control over data and processing and transparency. The major contributions of this paper are the following: First we have described motivating examples in federated databases and within each example we have pointed out its critical aspects. These aspects cover a large range of well known problems in the area of fede...

متن کامل

An Information Integration Framework for E-Commerce

because the storage systems lack structural and application homogeneity in addition to a common ontology. The semantic differences generated by a lack of consistent ontology can lead to conflicts that range from simple name contradictions (when companies use different names to indicate the same data concept) to structural incompatibilities (when companies use different models to represent the s...

متن کامل

Model Theoretic Semanticsfor

limited distribution notice This report has been submitted for publication outside of ITC and will probably be copyrighted if accepted for publication. It has been issued as a Technical Report for early dissemination of its contents. In view of thetransfer of copyright to the outside publisher, its distribution outside of ITC prior to publication should be limited to peer communications and spe...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1995